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Abstract 

Objective:\Ne performed an updated meta-analysis, using a comprehensive strategy of a logistic regression and a model- 
free approach, to evaluate more precisely the role of the rs4444235 variant near the Bone morphogenetic protein-4 (BMP4) 
gene in susceptibility to colorectal cancer (CRC). 

Methods: t\ total of 19 studies with 28770 cases and 28234 controls were included. Metagen system with logistic regression 
was applied to choose the most plausible genetic model for rs4444235. Generalized odds ratio (ORq) metric was used to 
provide a global test of relationship between rs4444235 and CRC risk. 

Results: Metagen analysis suggested the rs4444235 fitted best to an additive model. In assessment of the additive model, 
heterogeneity was observed (P= 0.059, /^ = 36.1), and pooled per-allele OR was 1.08 (95% CI = 1.05-1.1 1). Based on the 
model-free approach, pooled ORq was 1 .09 (95% CI = 1 .05-1 .1 4) under a random-effect model. Stratified analyses suggested 
heterogeneity could be in part explained by population ethnicity, study design, sources of controls, and sample size. 
Sensitivity analysis further supported the robust stability of the current results, by showing similar pooled estimates before 
and after sequential removal of each study. 

Conclusions:'\h\s meta-analysis provides a robust estimate of the positive association between the rs4444235 and CRC risk 
and further emphasizes the importance of the rs4444235 in CRC risk prediction. 
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Introduction 

Colorectal cancer is a major public health issue in developed 
countries and is becoming increasingly prevalent in Asia and 
Africa, with over 1.2 million new cases worldwide each year [1]. 
As other complex diseases, colorectal cancer is a complex trait 
driven by diverse etiologies involving in multiple environmental 
and genetic factors and their interactions [2] . Twin- and familial- 
based studies have provided clear evidence that approximately 
35% of all CRC cases have a genetic component [2]. Of all CRC 
cases, <5% can be accounted by a combination of some germline 
mutations with high penetrance, whereas most "sporadic" cases 
are due to large numbers of common variants with individually 
small effects [3]. 

Recently, genome-wide association studies (GWAS) have 
implicated multiple common single nucleotide polymorphisms 



(SNPs) in inherited predisposition to CRC [4,5]. The SNP 
rs4444235 at chromosome 14q22.2, mapping 9.4 kb upstream 
region of the gene encoding bone morphogenetic protein 4 
{BMP4), was firstly reported by a meta-analysis of GWAS data to 
be associated with CRC risk, with a combined OR of 1 . 1 1 (95 % 
CI = 1.08-1.15, P=8.1xl0"'") [6]. BMP4 is an important 
member of the BMP signaling pathway, which involves in CRC 
development through regulation of colorectal stem cell difFeren- 
tiation [7] . This SNP has been proposed to act as a cis-regulator of 
BMP4 and thus conferred to CRC risk [6]. However, the following 
rephcation studies yielded inconsistent results, in part due to 
"winner curse" in the original report [8], "Proteus phenomenon" 
in rephcation data [9], heterogeneous ethnical population, and 
insufficient statistical power, among other issues. 

Meta-analysis, by integrating published data, may be a powerful 
tool to clarify the inconsistencies across individual studies. Two 
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meta-analyses have been performed to assess rs4444235 in CRC. 
The meta-analysis by Li et al. [10], including 19893 cases and 
22106 controls, assessed multiple genetic models for the 
rs4444235, which would lead to multiple comparisons or 
erroneous mode specification without priori biological evidence. 
The other meta-analysis by Theodoratou et al. [1 1], including less 
samples (18607 cases and 19576 controls), utilized a maximum 
likelihood estimator to decipher plausible model for the 
rs4444235. However, in this meta-analysis, there was no subgroup 
analysis undertaken. To overcome the above mentioned short- 
comings in the previous meta-analyses, we integrati^d published 
data from 28770 cases and 28234 controls, and performed an 
updated meta-analysis, using a comprehensive statistical strategy. 
The methodology of logistic regression was applied to estimate the 
most plausible genetic model in the metagen .system [12]. The 
generalized odds ratio, based on model-free approach, was utilized 
to provide a global test of genetic association [13]. Stratified 
analyses were further performed to explore potential sources of 
heterogeneity. The core aim of this meta-analysis was to provide a 
more precise and robust evaluation for the role of rs4444235 
polymorphism in genetic susceptibility of colorectal cancer. 

Materials and Methods 

Search Strategy and Identification of Relevant Studies 

This meta-analysis were conducted according to the Preferred 
Reporting Items for Systematic Reviews and Meta-analyses 
(PRISMA) statement (Checklist SI) [14]. Genetic association 
studies regarding rs4444235 and colorectal cancer (CRC) risk were 
searched in the PubMed/MEDLINE and EMBASE databases 
through October 15, 2013, by using the combinations of the 
keywords: ("BMP4" or "rs4444235" or "14q22.2") and ("colo- 
rectal cancer" or "Colorectal neoplasmor" or "colon cancer" or 
"rectal cancer"). The similar search terms was also used for the 
WANFANG DATA and CNKI databases. The search was 
supplemented by review of reference lists for all relevant studies 
and review articles. AU relevant reports identified were included 
without language restriction. 

The following inclusion criteria should be fulfilled: (1) either 
case-control or nested case-control studies; (2) clear definition of 
colorectal cancer cases; (3) studies evaluating relationship between 
rs4444235 and CRC risk; (4) providing sufficient data to re- 
calculate the effect metrics, that was, numbers of genotypes in 
cases and controls. The authors were contacted via E-mail when 
eligible articles reported insufficient data. If they were unable to 
provide detailed data, those articles were ex( luded. Animal studies, 
reviews, conference abstracts, editorials and letters were excluded. 
If more than one ethnical population were in one report, each 
population was considered separately. Studies overlapping with 
other studies should be excluded, and the one with the most 
completed information was included. The first study on the 
association of rs4444235 by Houlston et al. was excluded [6], due 
to overlaps with the study by Tomlinson et al. [5] . The latter was 
chosen because of the larger sample. 

Data Extraction 

Data were extracted independently and in duplicate by 2 
reviewers (L. Liu & Q. Su). The following data was extracted from 
each article according to a fixed protocol: the first author, 
publication year, study design, country, ethnicity, source of 
controls, numbers of cases and controls, mean age of cases, sex 
ratio, site/type of colorectal cancer, genotyping method, minor 
allele frequency (MAF), and frequency of genotypes in cases and 
controls. 



Statistical Analysis 

Hardy- Weinberg equilibrium in controls was re-analyzed using 
the goodness-of-fit test (-P>0.05). The inverse variance method 
was apphed to estimate the pooled frequency of the risk allele (the 
C allele) in various ethnical populations. The genetic effect of the 
rs4444235 in CRC susceptibility was assessed using the approach- 
es described as below: 

Metagen system has provided a general framework to decipher 
the most plausible genetic model for the rs4444235 that treated the 
genotypes as independent variables in a logistic regression under 
both fixed and random effects models [12]. Under fixed-effect 
model, two parameters, 02 and Os were estimated using the logistic 
regression: logit (TCj) = a,- +02Ziz^63Zi3, where a,- was the indicator of 
study-specific fixed-effect, ORxc/tt = exp(92), and ORcc/ 
I T = exp(0.(). In order to account for an additive component of 
heterogeneity, a random-effect logistic regression was performed 
using the GLLAMM module in STATA software via introducing 
a study-specific random coefficient: logit (7t^) = a, +(6^+^12)^2+ 
{Os+Visjzis- The most plausible genetic model was determined using 
the following procedure: if 62 = 9 j = 0, no significant genetic- 
association was suggested; if 6^ = 0 and 6i>0, a recessive genetic 
model was suggested; if 62 = 9j>0, a dominant model was 
suggested; if 6^f>62>0, a co-dominant model was suggested; if 
262 — 63, an additive model was likely. In this meta-analysis, the 
genetic model of rs4444235 was best fitted with an additive model. 
Then the per-allele OR of the C allele (additive model ) with 
corresponding 95% confidence interval (95'K) CI) was estimated in 
a logistic regression model, by assigning scores of 0, 1, and 2 to the 
AA, AC and CC genotypes, respectively. Between-study hetero- 
geneity was assessed by the Cochran's based test and f 
metric. If there was no heterogeneity (i.e., if the test was 
significant [P<0.1] or f was less than 25%), a fixed-effect model 
was used to pool the estimate; otherwise, a random-effect model 
was applied. To explore the sources of heterogeneity, stratified 
analyses were performed, if feasible, according to population 
ethnicity (Asians, Caucasians, and Africans), sources of controls 
(population- and hospital-based), study design (GWAS and 
replication study), and total sample size (<2000 and >2000). 

Additionally, the generalized OR (ORq), based on a genetic 
model-free approach, was also introduced in this meta-analysis 
[13]. The ORg utilized the complete genotype distribution to 
provide an estimate of overall gene-disease relationship, given that 
the mutational load was treated as a graded exposure. Heteroge- 
neity was also assessed for ORq metric and stratified analysis was 
also performed. 

Sensitivity analysis was performed to assess the influence of 
single study on pooled estimates. Pubhcation bias was tested by the 
Egger's regression test and Begg's funnel plot. Statistical analyses 
were conducted in ORGGASMA, metan and metagen modules in 
STATA software version 13.0. A P value of <0.05 was considered 
statistically significant, except for estimation of between-study 
heterogeneity, where a significant level of 0.10 was applied. 

Results 

The Characteristic of Included Studies 

Figure 1 shows a flow diagram of the study selection process. 
The comprehensive search yielded 56 potentially relevant 
references. 18 articles were determined to be initially eligible by 
screening tides and abstracts. After further detailed evaluation, 7 
duplicated articles [6,15,16,17,18,19,20] and 3 articles with 
insufficient data [20,21,22,23] were excluded. 1 article was 
excluded due to small sample size (92 cases and 96 controls) 
[24] . 1 study in the article by Tomlinson et al. was excluded due to 
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deviation with Hardy- Weinberg equilibrium [5] . Finally, a total of 
7 articles with 19 studies of 28770 cases and 28234 controls were 
included in this meta-analysis [5,25,26,27,28,29,30]. The charac- 
teristics of these studies were summarized in Table 1 . Among the 
included studies, 15 studies were performed in Caucasians, 3 
studies in Asians, and 1 study in Africans. 

Pooled Frequency of the Risk Allele (the C Allele) in 
Controls According to Ethnicity 

Significant heterogeneity was seen both in Caucasians and 
Asians, and thus the random-effect model was applied (all P< 
0.0001, / = 82.21 and 92.40, respectively). The pooled frequency 
of the C aUele was 0.463 (95% CI = 0.452-0.474) in Caucasians, 
similar to that of 0.477 (95% CI = 0.423-0.532) in Asians. Only 1 
study was conducted in Africans, and the frequency of the C allele 
was 0.334. 

Overall Meta-analysis of the rs4444235 and Colorectal 
Cancer Risk 

Table 2 summarizes the results of overall meta-analysis. In the 
metagen analysis, the pooled ORtc/tt aid ORcc/tt were 1.08 
(95% CI= 1.03-1.12) and 1.18 (95% CI= 1.12-1.25), respective- 
ly, suggesting an additive model as the most plausible genetic 
model. Then the additive model for the rs4444235 was assessed 
using traditional method. In the additive model, heterogeneity was 
observed (P= 0.059, f = 36.1), and thus the random-effect model 
was applied. The variant was significantly associated with 
increased CRC risk, with a pooled per-allele OR of 1.08 (95% 
CI = 1.05-1.1 1; Figure 2). Based on the model-free approach, 
heterogeneity was also seen (P= 0.063, /=35.6). Under the 
random-effect model, signilicant result was also produced for the 



association of rs4444235 and CRC risk, with a pooled ORq of 
1.09 (95% CI =1.05-1.14). 

Stratification Analysis of the rs4444235 and Colorectal 
Cancer Risk 

When performed stratified analysis by population ethnicity, in 
Caucasian subgroup of 15 studies, heterogeneity was removed, 
and the significant association of the rs4444235 still existed for 
both additive model and ORq assessment (Table 2). However, in 
Asians of 3 studies, there was significant heterogeneity (P= 0.040 
and 0.041 for additive model and ORq, respectively), and no 
significant association was found. 

According to the sources of controls, in the population-based 
subgroup of 1 3 studies, analysis of the additive model and ORq 
both showed significant association of rs4444235 with CRC 
without evidence of heterogeneity, whereas in the hospital-based 
subgroup of 8 studies, significant heterogeneity was observed and 
no significant association was reported. 

Regarding to study design, there were 6 GWAS and 13 
replication studies. When assessing the additive model and ORq 
metric, both subgroups showed the positive genetic association 
with CRC risk, without evidence of heterogeneity. Interestingly, 
the pooled estimates in the GWAS (per-allele OR =1.12; 
ORg= 1.14) wc-rc slightly larger than those in the subgroup of 
replication studies (per-allele OR= 1.06; ORg= 1.07). 

The stratified analysis was also conducted according to total 
sample size (numbers of both cases and controls), into 2 subgroups: 
the large sample size subgroup (total sample size >2000) with 
22064 cases and 20876 controls and the small or moderate size 
subgroup (total sample size £2000) with 6706 cases and 7358 
controls. For both additive model and ORq analyses, heteroge- 



56 potentially relevant articles 
identified up to October 20 1 3 



18 artides retiieved f <^ further evaluatioa 



38 articles excluded (by title aad abstract): 

1 1 not related to rs4444235 and CRC 

1 related to rectal adenoma 

2 related to cancer lesions 

4 related to clinic outcome 
2 Meta-analysis 

13 reviews 

5 conference abstracts 



1 1 articles and 1 study excluded; 
3 lacked genotype data 
7 overlapped with other studies 
1 article with small sample size 

(92 cases and 96 controls) 
1 study deviated from ITWE 

7 articles with 19 studies included: 

15 studies in Caucasians 
3 studies in Asians 
1 study in Africans 



Figure 1. Flow chart of study selection process. 

doi:1 0.1 371/journal.pone.01 001 SB.gOOl 
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Study ID 



OR (95% CI) Weight % 



Fernandez (EPICOLON), 2010 

von Hoist, 2010 
Xiong (Beijing), 2010 
Kupfer(UC),2010 
Kupfer(UNC),2010 
Ho(HK),20II 
Tomlinson(LIKl),20II 
Tomlinson(SCOTl),2011 
Tomlinson (SCOT2),2011 
Tomlinson (VQ58),2011 
Tomlinson (CCFR),201 1 
Tomlinson (AU),2011 
Tomlinson (HEL),20 11 
Tomlinson (SEARCH),201 1 
Tomlinson (COIN/NBS),201 1 
Tomlinson (UK3),2011 
Tomlinson (SCOT3),201 1 
Tomlinson (UK4),2011 
Li (Jiangxi),2012 

Overall (I-squared = 36. 1 %, p = 0.059) 

NOTE: Weights are from random effects analysis 
1 

- - - - ^ 




Figure 2. The forest plot of the association between rs4444235 and colorectal cancer risk in the additive model. 

doi:1 0.1 371/journal.pone.01 001 33.g002 



neity was removed in the subgroup with large sample size, whereas 
in the small or moderate size subgroup, heterogeneity stUl existed. 
Both subgroups showed the significant association between the 
rs4444235 and CRC risk. 

Sensitivity Analysis and Publication Bias Assessment 

Since between-study heterogeneit)' was observed in this meta- 
analysis, we further performed sensiti\'ity analysis under the 
random-effect model. For the additive model, the sensitivity 
analysis, by sequentially omitting each study, reported a series of 
pooled OR with 95% CI exceeding 1.00, and the pooled ORs 
were similar before and after omitting each study (Table 3). 
Similar results were suggested for ORq analysis that no single 
study significantiy altered the pooled ORq. In the Begg's and the 
Egger's tests, there was no evidence of publication bias for both 
additive model and ORq (aU P values for Begg's and Egger's tests 
>0.05). 

Discussion 

Currently, traditional meta-analyses of genetic association 
studies are usually performed by collapsing genotypes in two 
categories assuming various genetic models. However, these 
different models are not independent, and a priori biological 
justification for the choice of a specific model is seldom available 



[31]. Additionally, interpretation of these results is complicated 
since a set of different estimates and significance tests are usually 
provided. In this current meta-analysis of rs4444235 and 
colorectal cancer risk, we utilized a comprehensive strategy, 
including the metagen analysis based on logistic regression and 
OR<3 metric based on model-free approach [12,13], to overcome 
the drawbacks in traditional meta-analysis of erroneous model 
specification and multiple model tests with an inflated T^pe I error 
rate, and make the interpretation of the current results easier. 

In this meta-analysis of 19 case-control studies of 28770 cases 
and 28234 controls, the metagen analysis indicated that the 
rs4444235 fitted best to an additive model. Knowledge of the best- 
fitting model for the rs4444235 may be important in optimizing 
the use of this SNP in colorectal cancer (CRC) risk prediction. 
Assessment of additive model indicated that CRC risk was 
increased by 8% per extra C allele. Based on model-free approach, 
the generalized OR (ORq) analysis showed that CRC cases with 
higher mutational load than healthy individuals have 9% higher 
risk for CRC susceptibility. Sensitivity analysis further supported 
the current results, by showing similar ORs before and after 
sequentially omitting single study. The positive association of the 
rs4444235 with CRC risk identified by this meta-analysis was also 
concordant with the findings of previous meta-analyses [10,11]. 

rs4444235 is 9.4 kb from the transcription start site of the 
BMP4. The BMP signaling has vital function in maintenance of 
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Table 2. Meta-analysis of rs4444235 and colorectal cancer risk. 



study characteristics 




Cases/controls 


Genetic model 


OR (95%Ci) 


F '%) 


Pfox heterogeneity 


Total (N = 19) 




28770/28234 


Additive Model 


1.08 (1.04-1 


.11) 


36.1 


0.059 








ORg 


1.09 (1.05-1 


.14) 


35.6 


0.063 


Ethnicity 


Caucasian (N = 15) 


25026/24217 


Additive Mode! 


1.08 (1.05-1 


.11) 


11.8 


0.321 




Asian {N = 3) 


3031/3102 


Additive Mode! 


1.11 (0.95-1 


.31) 


68.9 


0.040 




African (N = 1) 


713/915 


Additive Mode! 


0.89 (0.77-1 


.04) 








Caucasian (N = 15) 




ORg 


1.10 (1.06-1 


.14) 


11.7 


0.322 




Asian {N = 3) 




ORg 


1.14 (0.94-1 


.37) 


68.8 


0.041 




African (N = 1) 




ORg 


0.88 (0.74-1 


.05) 






Sources of controls 


Population based {N = 13) 


23807/22990 


Additive Mode! 


1.08 (1.05-1 


.12) 


16.6 


0.277 




Hospital based (N = 6) 


4963/5244 


Additive Mode! 


1.05 (0.95-1 


.15) 


59.5 


0.030 




Population based {N=13) 




ORg 


1.10 (1.06-1 


.14) 


15.3 


0.290 




Hospital based (N = 6) 




ORg 


1 .UD tu.y4- 


.18) 


59.3 


0.031 


Study design 


GWAS (N = 6) 


7325/8109 


Additive model 


1.12 (1.06-1 


.18) 


33.4 


0.185 




Replication (N = 13) 


21445/20125 


Additive model 


1.06 (1.05-1 


.11) 


33.6 


0.114 




GWAS (N = 6) 




ORg 


1.14 (1.07-1 


.22) 


31.0 


0.203 




Replication (N = 13) 




ORg 


1.07 (1.03-1 


.12) 


33.3 


0.116 


Total sample size 


<2000 (N = 10) 


6706/7358 


Additive model 


1.11 (1.03-1 


.19) 


57.6 


0.012 




>20000 (N = 9) 


22064/20876 


Additive model 


1.07 (1.05-1 


.11) 


0.0 


0.674 




<2000 (N = 10) 




ORg 


1.13 (1.03-1 


.23) 


57.1 


0.013 




>20000 (N = 9) 




ORg 


1.08 (1.05-1 


.12) 


0.0 


0.666 



Abbreviations: GWAS, genome-wide association study; OR, odds ratio; 95% CI, 95% confidence interval; ORg, generalized OR. 
doi:l 0.1 371 /journal.pone.01 001 33.t002 



Table 3. Sensitivity analysis of rs4444235 and colorectal cancer risk. 



Omitted study Additive model ORq 





OR (95% CI) 


If 




OR (95% CI) 


If 


F 


Fernandez (EPICOLON) [25] 


1.08 ( 


1.05- 


.12) 


0.059 


37.0 


1.10 ( 


1.06- 


.14) 


0.065 


35.9 


von Hoist [26] 


1.08 ( 


1.05- 


.12) 


0.082 


33.6 


1.10 ( 


1.06- 


.14) 


0.091 


32.4 


Xiong (Beijing) [27] 


1.08 ( 


1.04- 


.12) 


0.043 


39.6 


1.09 ( 


1.05- 


.14) 


0.046 


39.0 


Kupfer (UC) [29] 


1.08 ( 


1.05- 


.12) 


0.173 


23.8 


1.10 ( 


1.06- 


.14) 


0.174 


23.7 


Kupfer (UNO [29] 


1.08 ( 


1.04- 


.11) 


0.045 


39.3 


1.09 ( 


1.05- 


.14) 


0.048 


38.7 


Ho (HK) [28] 


1.08 ( 


1.05- 


.12) 


0.062 


36.4 


1.10 ( 


1.06- 


.14) 


0.066 


35.8 


Tomlinson (UKl) [5] 


1.07 ( 


1.04- 


.10) 


0.159 


25.2 


1.08 ( 


1.05- 


.12) 


0.164 


24.6 


Tomlinson (SC0T1) [5] 


1.08 ( 


1.04- 


.11) 


0.053 


37.9 


1.09 ( 


1.05- 


.13) 


0.056 


37.4 


Tomlinson (SC0T2) [5] 


1.08 ( 


1.04- 


.11) 


0.051 


38.3 


1.09 ( 


1.05- 


.14) 


0.055 


37.5 


Tomlinson (VQ58) [5] 


1.08 ( 


1.05- 


.12) 


0.062 


36.5 


1.10 ( 


1.06- 


.14) 


0.065 


36.0 


Tomlinson (CCFF!) [5] 


1.08 ( 


1.04- 


.11) 


0.047 


38.9 


1.09 ( 


1.05- 


.14) 


0.051 


38.3 


Tomlinson (AU) [5] 


1.08 ( 


1.04- 


.11) 


0.057 


37.2 


1.09 ( 


1.05- 


.13) 


0.058 


37.0 


Tomlinson (HEL) [5] 


1.08 ( 


1.04- 


.11) 


0.060 


36.7 


1.09 ( 


1.05- 


.13) 


0.064 


36.2 


Tomlinson (SEARCH) [5] 


1.08 ( 


1.05- 


.12) 


0.049 


38.5 


1.10 ( 


1.05- 


.14) 


0.052 


38.0 


Tomlinson (COIN/NBS) [5] 


1.08 ( 


1.04- 


.12) 


0.043 


39.7 


1.09 ( 


1.05- 


.14) 


0.046 


39.1 


Tomlinson (UK3) [5] 


1.08 ( 


1.04- 


.12) 


0.045 


39.3 


1.10 ( 


1.05- 


.14) 


0.047 


38.9 


Tomlinson (SC0T3) [5] 


1.08 ( 


1.04- 


.11) 


0.052 


38.0 


1.09 ( 


1.05- 


.14) 


0.054 


37.7 


Tomlinson (UK4) [5] 


1.08 ( 


1.04- 


.11) 


0.045 


39.2 


1.09 ( 


1.05- 


.14) 


0.049 


38.6 


Li (Jiangxi) [30] 


1.07 ( 


1.04- 


.11) 


0.145 


26.5 


1.09 ( 


1.05- 


.13) 


0.153 


25.7 



Abbreviations: OR, odds ratio; 95% CI, 95% confidence interval; ORg, generalized OR. 
*P values for heterogeneity were calculated by the Cochran's based Q test. 
doi:1 0.1 371/journal.pone.01 001 33.t003 
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Wnt signaling to inhibit differentiation of stem cell near colorectal 
crypt bases [7] . Heightened expression of BMP pathway members 
would restrain the Wnt signaling, subsequently activate P-catenin 
and elevate cells susceptibility to tumor-causing mutations, and 
ultimately promote colorectal carcinogenesis [7] . Intriguingly, in a 
recent study, luciferase reporter assay suggested the element to 
which rs4444235 maps acts as an aUele-specific transcriptional 
enhancer [23]. In CRC cell lines aUele-specific expression analysis 
indicated a significant association of increased BMP4 expression 
with the C allele [23]. These data have strongly supported the 
functional role of rs4444235 in CRC development through the cis- 
acting regulatory influence on BMP4 expression. 

Heterogeneity is a pervasive and difficult problem in meta- 
analysis of genetic association studies. Not surprisingly, heteroge- 
neity existed in this meta-analysis, and thus the findings should be 
interpreted with caution. Nevertheless, in stratified analysis by 
ethnicity, heterogeneity was removed in Caucasians and signifi- 
cant association of rs4444235 retained. According to study design, 
both in GWAS and replication studies, heterogeneity was 
effectively decreased, and association was also existed. Interest- 
ingly, the subgroup of GWAS yielded larger pooled ORs than that 
in repUcation data, indicating "winner curse" existed for the 
rs4444235 in GWAS. In regarding to sample size, only in the 
subgroup with large sample size heterogeneity was removed, but 
both subgroups showed significant genetic association. When 
stratified by sources of txjntrols, heterogeneity was removed in 
population-based subgroup. These findings suggested the hetero- 
geneity could be in part explained by the distinct natures of 
population ethnicity, control sources, study design, and sample size 
across individual studies. Furthermore, no single study had 
significant influence on the overall estimates in sensiti\ ity analysis, 
and no publication bias was observed in this meta-analysis, 
suggesting the robust stability of the current results. 
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